091-2230-8145     |      dataprojectng@gmail.com

COMPUTATIONAL INFERENCE TECHNIQUE FOR MINING STRUCTURED MOTIFS

  • Project Research
  • 1-5 Chapters
  • Abstract : Available
  • Table of Content: Available
  • Reference Style: APA
  • Recommended for : Student Researchers
  • NGN 3000

ABSTRACT

One of the major challenges in bioinformatics is the development of efficient computational tools for mining patterns. Structured motifs, like DNA binding sites in organisms with peculiarities in their genomic sequence like malaria parasite, Plasmodium falciparum have not been mined by existing structured motifs extraction tools. There is a need to develop faster computational tools to mine these DNA binding sites which are viable drug targets. This work was aimed at developing an algorithm for mining structured motifs in the genome of P. falciparum. The Gene Enrichment Motif Searching (GEMS) method for mining simple motifs was modified by incorporating the time efficient implementation of the suffix tree data structure with suffix links. This enables an improved searching speed, while adding an optimized position-weight matrix computation using the hypergeometric-based scoring function. This algorithm, Suffix Tree Gene Enrichment Motif Searching (STGEMS) was implemented in C programming language on Linux platform. An empirical evaluation of the sensitivity of STGEMS was conducted by comparing the similarity check mechanism of the GEMS algorithm for mining simple motifs with that used in another popular algorithm for extracting structured motifs, a Multi-Objective Genetic Algorithm Motif Discovery (MOGAMOD). The output of STGEMS algorithm was validated by comparing the motifs discovered with those obtained using biological experiments. A further validation was done by applying the STGEMS and GEMS algorithm to selected metabolic pathways and the results were compared. The STGEMS algorithm was tested with four sets of genes from the intraerythrocytic development cycle of P. falciparum. The speed of execution was evaluated using three simple motif discovery tools: Expectation Maximization Motif Elicitation(MEME), Gene Enrichment Motif Search (GEMS), and WEEDER as well as two structured motif discovery tools: RISOTTO and EXMOTIF on four different gene sizes.The high sensitivity of STGEMS in mining structured motifs from sequences in P. falciparum was proven empirically by its ability to identify 91% of the motifs in the sequences while MOGAMOD could not identify any motif. This validated the high sensitivity of the similarity check mechanism employed, in contrast with that used in MOGAMOD. The STGEMS algorithm identified 90% of the binding sites in P. falciparum which were similar to those obtained in biological experiments. On the selected metabolic pathways, STGEMS discovered all the simple motifs identified by GEMS, in addition to the structured motifs which GEMS could not identify. The empirical runtimes of STGEMS, MEME, WEEDER, GEMS, RISOTTO and EXMOTIF were respectively 20, 35, 26, 25, 28, 30 seconds for 20,000 base pair (bp), 32, 43, 44, 45, 42, 40 seconds for 40,000 bp, 41, 55, 56, 55, 52, 50 seconds for 60,000 bp and 54, 68, 69, 65, 67, 61 seconds for 80,000 bp respectively. The proposition resulted in a linear asymptotic runtime of O(N) at each iteration of the algorithm. The suffix tree gene enrichment motif searching algorithm developed was time efficient and successful in mining structured motifs like DNA binding sites in Plasmodium falciparum. This will aid a faster drug target discovery pipeline for the design of effective anti malaria drugs.




FIND OTHER RELATED TOPICS


Related Project Materials

COMPUTERIZED TRANSCRIPT MANAGEMENT SYSTEM A CASE STUDY OF CARITAS UNIVERSITY

ABSTRACT

This project is a computerized information management for transcript management which will help to over-come the undesirable pro...

Read more
RELATIONSHIP BETWEEN TEACHERS SELF- EFFICACY APPLICATION PACKAGE AND CLASSROOM PRACTICE IN SENIOR SECONDARY SCHOOLS TEACHERS IN ZARIA METROPOLIS, KADUNA STATE NIGERIA

ABSTRACT

This research focused on the relationship between teachers’ self-efficacy application package and classroom practices in s...

Read more
ASSESSMENT_OF_THE_PHYTOCHEMICAL_CONSTITUENTS_AND_PROXIMATE_COMPOSITION_OF_AFRICAN_PEER

Statement of the Problem

It is now known that agricultural materials are used as animal feeds and that they contain phytochemicals. These...

Read more
THE EFFECT OF ADVERTISING ON CONSUMPTION OF FAST FOOD IN NIGERIA

ABSTRACT

The roles of advertising on the consumption of fast food cannot be over emphasized. Advertisin...

Read more
TRAINING AS A VERITABLE TOOL FOR THE IMPROVEMENT OF SECRETARIAL EFFICIENCY IN AN ORGANIZATION

 

ABSTRACT

This Project work is “An Analysis of Training as Tool for improvement of secretarial Eff...

Read more
INFLUENCE OF HOME BACKGROUND AS CORRELATES OF JOB PERFORMANCE IN PUBLIC SECONDARY SCHOOL IN MBAITOLI L.G.A OF IMO STATE

Background To The Study

Through teaching, research, and community service, educational institutions all over the globe s...

Read more
MOTIVATION AND ITS EFFECT ON EMPLOYEE PERFORMANCE

BACKGROUND OF THE STUDY

It is known fact that the principal motive of management of any organization is...

Read more
NUTRIENT UTILISATION AND GROWTH PERFORMANCE OF CLARIAS GARIEPINUS FED DIFFERENTLY PROCESSED MUCUNA UTILIS MEALS AS A REPLACEMENT FOR SOYBEAN-BASED DIET

ABSTRACT

High cost of feed and competition between fish and other livestock‟s feed industries necessitate research into low cost, non-con...

Read more
A COMPARATIVE REVIEW OF PUBLIC RELATIONS FUNCTIONS IN THE NIGERIAN BANKING SYSTEM AND GOVERNMENT PARASTATALS

EXCERPT FROM THE STUDY

Public relations are the art and social science of that link inside and outside both public and p...

Read more
THE PERCEIVED EFFECTIVENESS OF PUNISHMENT ADMINISTERED ON STUDENTS BY SECONDARY SCHOOL TEACHERS.

BACKGROUND OF THE STUDY

In Nigeria today, acts of delinquency and violence among secondary school students have become a...

Read more
Share this page with your friends




whatsapp